Skip to main content

All Questions

5votes
1answer
49views

How to do Exploratory Data Analysis when my response variable is binary?

I am doing a multilevel regression, and my response variable is binary (presence of females on a tech board). all the EDA methods i know are about plotting correlation, but this as this is a binary i ...
Anya's user avatar
0votes
1answer
96views

Permutation importance question - all zero for the features

I have the following code: ...
Victorsmoreschi's user avatar
1vote
2answers
2kviews

Learning from aggregated data

Online and in the literature there seems to be a general consensus that training a machine learning model using aggregated data is harder and/or fundamentally different from training on raw event data....
dendog's user avatar
0votes
1answer
335views

How to cluster based on sensor data? - My first data science job

I'm on my first (real), data, programming job. As everyone can imagine, this can be quite hard and I learn a lot from it, given I am a data science student in university. However, I am completely ...
Martijn's user avatar
-3votes
1answer
41views

What to do if your model's prediciton result wrong because of unlucky?

Have you ever had a situation where your model backtested with very good with historical data, and you also felt that your model was very logical? But when put it into practice case to predict the ...
Quang Hoàng Minh's user avatar
0votes
0answers
26views

Can Hip-Hop/music trend be estimated?

How can I determine how an "evolutionary" music album affects the development of its genre? Only two perspectives I can come up with: 1. The effect on the number of songs before and after(...
Hanson Zhao's user avatar
0votes
0answers
19views

Where should I find electrolytic capacitor ageing data

I am trying to get a dataset of Electrolytic capacitors ageing and I am not being able to find one that shows the ripple current and the voltage in order to calculate its Equivalent Series Resistance (...
Jan Cabrera's user avatar
0votes
1answer
28views

How do I decide the frequency of data capture for modeling? How does it affect my final model?

I plan to capture data to predict energy consumption in a food processing plant. I want to capture production details such as how much each category of food is produced, what is the machine's output, ...
NAS_2339's user avatar
0votes
2answers
423views

How match output (pred value) to input value

I'm working with data(with 4 columns which are p(product), M(name of the store)), I want predict the demand of store for that I sued SVR on the data by theses formulation: ...
ramin's user avatar
1vote
1answer
23views

Is there enough datapoints to make a reasonable predcitction?

I am planning to use this dataset (for a school project) in order to determine the most important features in predicting whether a student will receive a placement. Further I would like to create a ...
user111510's user avatar
2votes
0answers
39views

How to aggregate Weather Data from county to state level? [closed]

I am doing a prediction of forest fire risk using the weather data and the fire incident data. I have data related to different county and the weather data on the day of the incident. As the incident ...
Shiv Kumar Ganesh's user avatar
0votes
1answer
35views

How to adjust/smooth a certain number using constants or rules

Hi, I am handling a dataset with a customer purchase history. The field ord_cnt represents the purchase without coupon usage, and cpn_ord_cnt represents the purchase with coupon usage. There are two ...
Donsuke's user avatar
1vote
0answers
23views

Fit model function out defined data range

I have asked this on SO but it has not been well accepted because it seems to be more about data science than programming. Let's say I have a set of data ...
Py-ser's user avatar
2votes
2answers
4kviews

What is the difference between a data-driven model and an empirical model?

Are they the same? Empirical models, per Wikipedia, are any kind of (computer) modelling based on empirical observations rather than on mathematically describable relationships of the system ...
user3707850's user avatar
1vote
1answer
1kviews

How to predict consumer purchase in next 6 months?

I'm working on a model to predict a customer as being 'in-market' for a product in the next 6 months. The dataset has a wealth of information like lifestyle and demographic variables and previous ...
CM23's user avatar

153050per page
close